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ABSTRACT 

Personalalized PageRank uses random walks to determine 
the importance or authority of nodes in a graph from the 
point of view of a given source node. Much past work has 
considered how to compute personalized PageRank from a 
given source node to other nodes. In this work we con- 
sider the problem of computing personalized PageRanks to 
a given target node from all source nodes. This problem can 
be interpreted as finding who supports the target or who is 
interested in the target. 

We present an efficient algorithm for computing personal- 
ized PageRank to a given target up to any given accuracy. 
We give a simple analysis of our algorithm's running time 
in both the average case and the parameterized worst-case. 
We show that for any graph with n nodes and m edges, if 
the target node is randomly chosen and the teleport prob- 
ability a is given, the algorithm will compute a result with 
e error in time O (— (— +log(n))). This is much faster 
than the previously proposed method of computing person- 
alized PageRank separately from every source node, and it is 
comparable to the cost of computing personalized PageRank 
from a single source. We present results from experiments 
on the Twitter graph which show that the constant factors 
in our running time analysis are small and our algorithm is 
efficient in practice. 



1. INTRODUCTION 

Personalized PageRank is a random-walk based method 
of modeling how nodes are related in a graph like a social 
network, the web graph, or a citation graph. It has been 
used in a variety of application including personalized search 
18], link prediction 19] p], link-spam detection B], and graph 
partitioning [2]. Previous work has considered how to com- 
pute personalized PageRank from a single source node. In 
this work, we consider the problem of computing personal- 
ized PageRank to a single target node from all source nodes. 
More precisely, given a node v in a directed (or undirected) 



graph G — (V,E), we would like to approximate the per- 
sonalized PageRanks n(u, v) from all nodes u G V to the 
target node v. We define the personalized PageRank tt(u, v) 
from a node u to a node v to be the fraction of time we 
spend at v on a random walk from u, where after each step 
we stop with a given probability a. Note that this is dif- 
ferent from reversing the edges and computing personalized 
PageRank from a single source v. If edges represent inter- 
est, this problem can be interpreted as finding the nodes u 
which are interested in v, or if edges represent support this 
problem can be interpreted as finding nodes which support 
v. 

This problem has several applications. In a social network, 
for example, whenever v produces content, we might want to 
find the nodes u with n(u, v) above some threshold and add 
the content to each such it's feed. Or an advertiser uona 
social network might want to give special offers to the nodes 
u which are most interested in it. The use of personalized 
PageRank in recommendation and trust systems is discussed 
in [I]. On the web graph, this problem has been considered 
before. In p], the first phase of the authors' algorithm to 
detect when a web page v is benefiting from link-spam is to 
compute the set of nodes u with a high value of tt(u, v). 

The simplest solution to this problem is to compute per- 
sonalized PageRanks from every source node u using known 
methods like Monte Carlo km p] or power iteration HO] . This 
is the solution proposed in M] , the only previous work we are 
aware of which has considered this problem. In H] , the cost 
of computing personalized PageRank from every source us- 
ing Monte Carlo is amortized because there are a large num- 
ber of target nodes v. However, this simple solution requires 
n Monte Carlo computations even for a single target v. As 
shown in [6] using the Chernoff bounds, computing an e ap- 
proximation to ir(u, v) with high probability from a single 
source u to all v takes O (\ log (n)) time. Thus even for a 
single target this approach would take O (n-^i log (n)) time. 
The challenge we address is finding an algorithm which can 
find all nodes u that have high values of n(u,v) without 
doing work linear in n. 

We present an algorithm which, given v, approximates 
tt(u, v) for all u to within a given additive error without 
needing to visit all the nodes. Our method is to start at the 
target node v and propagate updated estimates of 7t(m, v) 
backwards along edges. In power iteration, every node prop- 
agates its current value in every step. The key idea of our 
algorithm is to maintain a priority queue and only prop- 
agate the value of the node whose value has changed the 
most since its value was last propagated. It is very simple 



to implement: the entire algorithm is shown in AlgorithmfT] 
We prove that it is efficient both in an average-case and a 
parameterized sense. We also present experiments on part 
of Twitter's graph to show that it is efficient in practice. 
In this work we assume that a single processor is used for 
each target v and that the graph is stored in local or dis- 
tributed RAM. If there are multiple targets, parallelism can 
be achieved by assigning different targets to different pro- 
cessors. 

The contributions of this work are the following: 

• In section [5] we present a simple algorithm for com- 
puting personalized PageRanks to a target node up to 
any given additive error. In section [5T] we analyze the 
approximation error of the algorithm and prove it is 
correct. 



• In section [572| we show that for an arbitrary graph G 
and a target node v chosen uniformly at random, our 
algorithm runs in time 
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where — is the average degree of a node, e is the desired 
additive error, and a (typically between 0.1 and 0.2 in 
practice) is the probability of stopping after each step 
of the walk. This is comparable to the cost of running 
Monte Carlo from a single source node, O (\ log(ro)) 
for high success probability, and it is much less than 
the cost of running Monte Carlo from every source 
node, O (n-^ log(n)) . 



• In section [573] we show that for an arbitrary graph G 
and arbitrary target node v, our algorithm runs in time 
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where D v {ae) = £ u:7rKt , )>ae (|IN(m)| + log(n)) is a 
parameter which captures how difficult the problem is 
for v. This shows the asymptotic dependence on e is 
O (log (-)), and not O (4r) as it is for Monte Carlo 
from a single a single source. Even if e is small enough 
that we must consider the entire graph, D v (ae) < 
m + n\og(n) < 2m for graphs with m < nlog(n). 
For such graphs as e goes to zero the algorithm de- 
grades gracefully to the asymptotic performance of 
power-iteration, 0(mlog(i)). Thus for larger e we 
get the benefit of only exploring a small set of nodes 
with high personalized PageRank to the target, while 
for small e the running time is still comparable to the 
cost of running power iteration. 

In section [6] we present results from an experiment on 
part of the Twitter graph with 5.3 million nodes and 
380 million edges. We find that our error analysis is 
tight and that D v (ae) is an accurate paramterization 
of the running time. As one example, we find that for 
a approximation of e = 10~ 5 , the priority queue al- 
gorithm takes 1.2 seconds while power iteration takes 
410 seconds to achieve additive error e on the same 
machine. This shows that the local nature of the algo- 
rithm can give significant savings. 



2. RELATED WORK 

Personalized PageRank was first suggested in the original 
PageRank paper [lO], and much follow up work has consid- 
ered how to compute it efficiently. Our approach of propa- 
gating estimate updates is similar to the approach taken by 
Jeh and Widom [8: and Berkin 5 to compute personalized 
PageRank from a single source. Our equation (Tip appears 
as equation (10) in 8 . Both of these works suggest the 
heuristic of propagating from the node with the largest un- 
propagated estimate. Our work is different because we are 
interested in estimating the values n(u, v) for a single target 
v, while earlier work was concerned with the values for a 
single source u. Because of this, our analysis is completely 
different, and we are able to prove running time bounds. 

To the best of our knowledge, the only previous work to 
consider the problem of computing personalized PageRank 
to a target node was by Benczur et al. *4 , where it is used 
as one phase of an algorithm to identify link-spam. They 
observe that a node v's global PageRank is the average over 
all nodes u of tt(u, v). Thus to determine how a node v 
achieves its global PageRank score, they propose we first 
find the nodes u with a high value of ir(u,v). Once that 
set has been found, it can be analyzed to determine if it 
looks like an organic set of nodes or an artificial link-farm. 
To compute the values of n(u,v) for each v, they propose 
taking random walks from every source node and do not 
consider other methods. 



3. PRELIMINARIES 

We are given a directed or undirected graph G — (V, E). 
For now we assume G is unweighted, but in section [5~4] we 
show how our algorithm and theorems generalize easily to 
weighted graphs. We define OUT(u) = {w : (u, w) 6 E} 
and IN(-u) = {w : (w,u) £ E}. We are given a parameter a 
which determines the expected length of a random walk, — . 
For u, v G V, we define personalized PageRank ir(u, v) to be 
the fraction of time we spend at v on the following random 
walk: we start at u and at each step with probability a 
we halt, while with probability 1 — a we transition to a 
random out-neighbor of the current node. We refer to a 
as the teleport probability because another description of 
the Markov chain is the following: the process never halts, 
and at each step with probability a we teleport back to u 
and continue from there, while with probability 1 — a we 
transition to a random out-neighbor of the current node. 

There may be dead end nodes with no out-neighbors in 
the graph, so for convenience we introduce an artificial sink 
node with a self-loop and introduce an artificial edge to the 
sink from each dead end node. Alternatively, we could have 
artificially added a self-edge to each dead end node, or said 
that the walk should halt when it reaches a dead end node. 
These alternatives result in a slightly different boundary case 
or normalization, but the exact choice doesn't matter sig- 
nificantly. 

In the original PageRank paper [To], the authors propose 
that when the random walk for computing PageRank tele- 
ports, the resulting node could be chosen from an arbitrary 
distribution. We focus on the case when the distribution 
has a single point of support, because that is the case rel- 
evant to our applications. The PageRank function is linear 
in the personalization distribution, as shown in 18], so com- 
puting PageRank on single-point distributions is sufficient 



for computing it on arbitrary personalization distributions. 
In the worst case, the target node v might have an edge 
from every other node in the graph, so we must do Q(n) 
work even for a rough approximation. To parameterize the 
difficulty of the problem for a given v, we define 

D v (x)= Yl (|IN(u)|+log(n)). 

This parameter captures the idea that for each node u which 
has a large personalized PageRank to v, we must consider 
all of w's in-neighbors to see if any of them also have a large 
personalized PageRank to v. The log(n) term captures the 
cost of popping from a priority queue. 

In evaluating our approximation we consider additive point- 
wise error (the L°° norm). Given error threshold e, we seek 
an estimate s(u) for each u such that 

max|s(u) — 7r(u, v)\ < e. 

u£ V 

We choose this error measure because in applications we 
are often only interested in the nodes u with a large value 
of 7r(u, v), and we don't care if there are a large number of 
nodes with very small values of n(u, v) which have been esti- 
mated to be 0. For efficiency we want the resulting estimate 
vector s to be sparse unless e is very small, and this norm 
allows for a sparse estimate vector. 

4. A RECURRENCE FOR PERSONALIZED 
PAGERANK 

Our algorithm is based on a recurrence equation that re- 
lates the value of n(u, v) to the values of 7r(w, v) for w £ 
OUT(it). To derive this recurrence, it is convenient to think 
about the number of times we visit v on a random walk 
rather than the fraction of time we spend at v. The number 
of times we visit v is proportional to the fraction of time 
tt(u, v) because over a large number of walks, the number 
of times we visit v will be the fraction of time we spend at 
v multiplied by the average length of a walk, — . A ran- 
dom walk from u begins by either teleporting immediately 
or by transitioning to a random neighbor, so the expected 
number of times we reach v from u is the probability of not 
teleporting immediately times the average expected num- 
ber of times we reach v from an out-neighbor of u. Thus 
personalized PageRank satisfies the recurrence 
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We add a when u — V because a walk from v clearly visits 
v on its first step regardless of what happens next, and this 
visit corresponds to an a-fraction of an average walk. This 
equation appears as equation (10) in [8], where the authors 
give an alternate proof using linear algebra. 

5. THE PRIORITY QUEUE ALGORITHM 

Given a target node v, our algorithm is based on the idea 
of propagating updates outwards from v. We maintain for 
each node u a score s(u) which estimates 7r(u, v) from be- 
low and improves as the algorithm progresses. Using the 
recurrence of equation |l| , we see that when we update our 
estimate n(w, v) for some node w, we need to update our 



estimate of n(u,w) for each u € IN(tu). Hence the basic 
update step of the algorithm is to choose a node w and in- 
crease the score of each in- neighbor it by ( 1 — a) ou ^ , -, s(w). 
Since we might propagate a node w's score more than once, 
it is important that we only propagate the part of w's score 
which changed since the last time w's score was propagated. 
We let p(w) denote the difference between w's current score 
s(w) and w's score when its score was last propagated. We 
use a priority queue Q ordered by priority p(w) so we can 
easily find the node with the largest value of p(w). The com- 
plete algorithm is shown in Algorithm [l] as long as some 
node has priority above a minimum threshold, we pop off 
the node with the greatest priority and propagate its score 
to its in-neighbors. 

Algorithm 1 Computing personalized PageRank to a tar- 
get^ 

Input: digraph G = (V, _E),teleport probability a,target 

vertex v, error tolerance e 
Output: Approximation s : V — ¥ [0, 1] to personalized 

PageRank such that for all u, \ir(u,v) — s[u]\ < e 

for u in V do 

s[u] = p[u] — 
end for 
s[v] = p[v] = a 

q — Max Priority Queue on V ordered by key p 
while g.maxPriorityO > a ■ e do 

w = g.popMaxElement() 

for u in w.inNeighbors() do 
Aa=(l-a) — 'P 

\ ' w.outDcgrcc 

s[u] — s[u] + As 
g.increasePriority(ti, p[u] + As) 
end for 
p[w] — 
end while 

An example run of 6 iterations of the algorithm is shown 
in Figure^ 

5.1 Error Analysis 

One key question was the priority threshold at which we 
should stop popping nodes. Initially we considered threshold 
e, but this threshold is not small enough to ensure that all 
errors are less than e. We now show that ae is a sufficient 
threshold, and our experiments show that it is tight. 

Theorem 1 (Correctness). When the priority queue 
alqorithm is run until all priorities are less than ae, the 
resulting score vector s satisfies \s(u) — ir(u,v)\ < e for all 
ueV. 

Proof. After the algorithm has run to completion, let u 
be the node with the greatest additive error and let E = 
|-7r(u, v) — s(u)\ be its error, so for all nodes w, |-7r(w,i;) — 
s(w)| < E. Recall equation ffl: 



n(u,v) = (1 — a) 



OUT(u)| 



^~1 n(w,v) + 



meOUT(u) 



a, u = v 
0, u ^ v 



When the algorithm has completed, a node u's score is equal 
to the sum of the amount of score it has received from each 
out-neighbor. An out-neighbor w has final score s(w) and 
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Figure 1: The first six iterations of the priority queue algorithm run on a simple graph with four nodes and 
target node v. The node which will propagate its priority next is shown with a dark background. The score 
s for a node u is our current estimate of n(u,v), and the priority p is the amount of unpropagated score. 



final un-propagated score p(w), so the amount propagated 
is s(w) — p(w). This gives us the following: 



s(u) — (1— a) 



OUT(u) 



Y^ (s(w)-p(w))+ 



ioeOUT(u) 



a, u — v 
0, u 7^ v 



where p(w) < ae is the part of to's score which has not been 
propagated back to u. Subtracting these two equations, we 
see that 



E — \tv(u, v) — s(u)\ 
= (!-<*)■ ' 



OUT(u) 



E \ir(w,v) - s(w) 



n>60UT(u) 



+ (l-o) 



1 



|OUT(u)| 



E p( % 



meouT(u) 
< (1 -a)E+ (l-a)ae 

where we've used the fact that for all nodes w, \tt(w,v) — 
s ( w )\ < E and p(w) < ae. Isolating the error E we conclude 
that 



E< (l-a)e< e. 



D 



5.2 Average Running Time 

Next we analyze the running time of this algorithm. In the 
worst case, our target node v could have a high personalized 
PageRank from every other node, forcing us to consider the 
entire graph and do Q(m) work. Thus to give a useful bound 
on the running time, we give both an average case analysis 
and a worst-case parameterized analysis. First we analyze 
the priority queue algorithm in the average case where the 
target node v is chosen uniformly at random. 

Theorem 2. Let an arbitrary graph G, additive error tol- 
erance e, and teleport probability a be given. Let n be the 
number of nodes in G and m be the number of edges. If v is 
chosen uniformly at random from V , then the priority queue 
algorithm runs in expected time O (— (— + log(n))) steps. 

Proof. Suppose we ran the algorithm once for every v £ 
V. When the target node is v, the number of times a node u 
can be popped from the queue is at most ^ > since its 

priority decreases by at least ae each time it is popped and 
the total accumulated priority is at most 7t(m, v). The time 
to propagate the score from a node u is 0(jIN(u)|) steps 
since each of its in-neighbors must receive some of its score. 
We also must do 0(log(n)) work to pop the maximum node 
from the priority queue. Thus the running time for all n 
nodes is at most 



EE 



7r(w, v) 



0(|INw|+log(n)) 






^^ ]1 ^ 1 0(\Wu\+ log(n)) 

U V 

= V — OfllNul+logfn)) 
*— ' ae 

u 

= q f m + n\og{n) 
\ ae 

and the average running time per node is as claimed. D 



In the appendix we prove a bound which is tighter for 
e > i in the case when the personalized PageRanks from 
each source follow a power law. 

Because we perform a large number of increase-priority 
operations on the priority queue, the best asymptotic time 
is achieved by using a Fibonacci heap for the priority queue. 
With a Fibonacci heap [7] , we can increase a node's priority 
in constant amortized time, so in the above analysis the 
cost of 0(|IN(w)|) increase-priority operations is 0(|IN(«)|). 
In our experiments we use a standard binary-heap priority 
queue for simplicity. 

Also note that our average running time analysis did not 
use the fact that we are using a priority queue. The same 
time bound would hold if we simply maintained the set of 
nodes u with 7r(u) > ae and repeatedly popped an arbitrary 
element of this set. This is an alternative implementation of 
the algorithm which avoids the cost of the queue. By remov- 
ing the cost of the priority queue from the above analysis 
we see that this alternative runs in time 
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Our parameterized running time analysis does use the pri- 
ority queue property to improve the dependence on e from 
1 to log (1). 

Comparison with Monte Carlo In 4 , the authors sug- 
gest computing values of 7r(u, v) to a given target v by taking 
Monte Carlo walks from every other node. As shown in [6] 
using the Chernoff bounds, computing an e approximation 
of 7t(m, v) for a single source u and with failure probability 
S takes 



e 



■log 



steps. If the graph is sparse enough or e is small enough, 
our average time bound of 



o 



1 m 



is better than this. Thus to compute an e approximation 
of personalized PageRank for all pairs of nodes, running the 
priority queue algorithm to every node is a viable alternative 
to running Monte Carlo from every node. 

5.3 Parameterized Running Time 

Next we give a paramterized bound that applies to arbi- 
trary graphs and arbitrary target node v. As in the prelim- 
inaries section, we define 

D v (x)= J2 (|IN(u)| + log(n)) 

u:iv(u,v)>x 

to capture the difficulty of computing personalized PageR- 
ank to the target v. 

Theorem 3. If the priority queue algorithm is run with 
teleportation probability a, target node v, and additive error 
e, it takes time 

of^Miogfl- 

\ a \ea 

Proof. We divide the execution of the algorithm into 
log 2 (— ) stages, where in stage i it pops nodes with priority 
greater A until all nodes have priority less than A After 



stage i has completed, by TheoremfT] the difference between 
the score of a node u and the true value of ir(u, v) is at most 
J— . This implies that each node u can be popped at most ^ 
times in each stage, since each pop in stage i+1 decreases the 
difference between s(u) and n(u,v) by at least ^i+t- Each 
time a node u is popped we do 0(|IN(it)|) work increasing 
priorities and 0(log(n)) work popping the node from the 
priority queue. This gives us a running time of 
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= o(-D v (ae)log( — 
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5.4 Extension to Weighted Graphs 

We assume that the graph is unweighted for simplicity, but 
our algorithm extends immediately to the case of a weighted 
graph, in which weight [it] [w] is proportional to the proba- 
bility of transitioning from node i to node j on a random 
walk. In this case, the change in score in Algorithmfllshould 
become 



(1 



p[w] weight [u] [w] 



u.weightedOutDegree 

where tt.weightedOutDegree is defined as ~^2 w weight [u][w] 
Similarly, the power iteration equation (TlJ> should become 

7r(w, v) — (1 — a)- p— ^ YJ weight [u][v] • tt(w,v) 



+ 



cIout(m) 
u — V 



«j€OUT(o) 



where c(out = X)™eouT(u) weight [it] [a] is the weighted out- 
degree of u. All the proofs can be modified similarly. The 
theorem statements remain the same. 

6. EXPERIMENTS 

For our experiments, we used a part of the Twitter fol- 
lower graph with 5.3 million nodes and 389 million edges. 
We ran an experiment for each setting of parameters in the 
Cartesian product of teleport probability a € {0.1, 0.2} and 
additive error e £ {10 -4 , 10~ 5 , 10 -6 }. For each experiment 
we chose 100 target nodes v uniformly at random and ran 
the priority queue algorithm. Since nodes with high global 
PageRank might be targets more often than other nodes, we 
repeated the above setup sampling 100 target nodes with 
probability equal to their global PageRank. We measured 
the number of steps (defined as the number of times we up- 
dated some node's priority in the inner loop), the change in 
wall-clock time, and the maximum error. We measured the 
maximum error by running power iteration, equation |l]), 
until convergence and comparing the result pointwise with 
the result of the priority queue algorithm. 

We first note that our error analysis is tight. For effi- 
ciency, we want to do as few operations as possible to reach 
our desired error tolerance e. If our empirical error was 
much lower than our target error e, it would indicate that 
we were wasting effort achieving an accuracy which is finer 
than required. 

However, on the Twitter graph there are nodes with em- 
pirical error 0.85e, which is quite close to our proven bound 
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Figure 2: The empirical error of our algorithm af- 
ter convergence. To obtain this data, we set tele- 
port probability a — .1 and error threshold e = 10 -6 , 
choose 100 target nodes v with probability equal to 
their global PageRank, ran the priority queue al- 
gorithm to obtain scores s, and then computed the 
empirical error max tlg v|7r(w, v) — s(u)\. The x-axis is 
empirical error divided by e. Notice that most nodes 
have an error which is a large fraction of e, showing 
that our error bound is tight in practice. 



of e. A histogram of the empirical errors for e = 10 and 
targets sampled from the global PageRank distribution is 
shown in Figure [2] Notice that for these parameters the 
empirical error is often more than 50% of the proven bound 
e. 

We compare our actual average running time to the bound 
from Theorem |2l — — steps, and find that the algorithm 
actually runs faster than the bound requires. For a = 0.2 
and all three values of e, the algorithm uses less than 3% of 
the number of steps the bound allows. For a = 0.1, and all 
values of e the algorithm uses less than 20% of the number of 
steps the bound allows. A histogram of the running times is 
shown in figure [3] Notice that the step-axis is log-scale, and 
most nodes use far fewer steps than the bound represented 
by the vertical line allows. 

Our parameterized analysis shows that the number of 

steps needed is at most O I ' '^ log (^M ). To measure 
how tight this is, we compared the number of steps taken to 
D v . To use more adversarial v, we sampled v from the global 
PageRank distribution instead of uniformly at random, so v 
with high global PageRank will be chosen more often. We 
found that in practice D v (ae) is an excellent predictor for 
the number of steps, and that the constant of proportion- 
ality in practice is much less than i log (^-). For example, 
with a = 0.1 and e = 10 -5 , the proven ratio between step 
count and D v (ae) is 200, but in our experiment the average 
ratio is less than 4. The distribution of ratios is shown in 
Figure [4] Note that for most nodes, the number of steps 
taken is within a factor of 2 of D v (ea) even though the ab- 
solute number of steps varies on an exponential scale, as 
shown in Figure [3] 
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Figure 3: The number of steps required to reach 
convergence, defined as the number of times we up- 
dated some node's priority in the inner loop. To ob- 
tain this data, we set teleport probability a = .1 and 
error threshold e = 10 J , choose 100 target nodes 
v uniformly at random, and ran the priority queue 
algorithm. The vertical line indicates the average 
running time bound of Theorem [2l Notice that the 
x-axis is log-scale, so most nodes require many fewer 
steps than the bound allows. 
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Figure 4: The number of steps required to reach 
convergence compared to the parameter D v (ae). To 
obtain this data, we set teleport probability a = .1 
and error threshold e — 10 , choose 100 target 
nodes v sampled from the PageRank distribution, 
and ran the priority queue algorithm. Notice that 
for most nodes, the number of steps taken is within 
a fraction of 2 of D v (ea) even though the absolute 
number of steps varies on an exponential scale, as 
shown in Figure [3l 



Figure 5: The average wall-clock running time of our 
algorithm compared to power iteration. To obtain 
the first column, we set teleport probability a — .1, 
choose 100 target nodes v uniformly at random, and 
ran the priority queue algorithm until completion. 
To obtain the second column, we measured the time 
for an average iteration of applying equation (TTT) and 
multiplied by log 1 _ Q (^), the number of iterations 
needed for accuracy e. Notice that by propagat- 
ing the largest changes first, our algorithm is much 
faster than power iteration. 



Now we compare our algorithm's performance to power 
iteration and observe the benefit of only visiting a set of 
nodes around v. It can be shown that equation ([l]) is a con- 
traction mapping with contraction ratio 1 — a. Thus one 
alternative to our priority queue algorithm is to apply equa- 
tion <TlJ> repeatedly. Using the contraction map property, to 
guarantee additive error e we must do log 1 _ a - iterations. 
In our experiment, we also computed personalized PageR- 
ank to each target using power iteration in order to measure 
the empirical error. We found that each iteration of apply- 
ing equation (II]) took 3.9 seconds, and this was stable over 
a large number of iterations (the graph is too large for the 
processor cache to help). Because our algorithm explores 
only a neighborhood around v, it is often much more effi- 
cient than power iteration. For example, when a = 0.1 and 
e = 10 -4 , our algorithm took 0.2 seconds on average, which 
is 1700 times faster than the 87 iterations needed to guar- 
antee at most e error. For smaller e, our algorithm is forced 
to consider more of the graph, so its relative advantage di- 
minishes. For the smallest value of e we tried, e = IO -6 , 
our algorithm took 30 seconds on average, while power iter- 
ation takes 3.8 log^^ - ~ 500 seconds. A table of running 
times for the two algorithms is shown in Figure [5] As e 
tends to zero, our algorithm's performance degrades grace- 
fully to within a constant factor of the performance of power 
iteration, as proven in Theorem [3] 
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APPENDIX 

A. AVERAGE RUNNING TIME FOR POWER 
LAW GRAPHS 

We can get a better bound on the expected running time 
if we assume a power law on the personalized PageRank 
values: suppose that for each u there is some /3 £ (0, 1) such 
that if we order the nodes vi, . . . ,v„ in decreasing order of 
n(u,Vi) then 

Tv(u,Vi) = rji~P 



for some constant r\. Since £7 n(u, v) 
is determined by /3: 



1, the value for n 



r/: 







7, 1 -/ 3 



Such a power law was observed empirically on the twitter 
graph in *3| with /3 ~ 0.75. For simplicity we assume that 
all nodes have the same exponent /3. 



Theorem 4. For a graph G in which the personalized 
PageRanks from each node follow a power law with expo- 
nent j3, if v is chosen uniformly at random from V , then the 
priority queue algorithm runs in time 



O 



1 

where m is the number of edges in the graph. 

Proof. Suppose we ran the algorithm once for every 
v 6 V. As in the average-case analysis proof, the run- 
ning time is at most ^2 u ^ |INw|. With the power 

law assumption, the majority of nodes will not be popped 
even once because n(u, v) < at. The largest i such that 
Ti{u,Vi) — r\i~^ > ae is 



\aeJ 



so we only need to visit this many distinct nodes v for each 
u. The running time for all n nodes is thus at most 



EE 



7r(w, v) 

ae 



\wu\<yy r ^- 

' ^^ ae 



lINul 



ae 
< m 



ElPN* 



dx 



r\ i t 



(3+1 



ae-/3 + l 

Substituting in the value of i* and r\ we see that the total 
runtime for all n target nodes is 



11 rf 



,(M 



where c = (1 — 0)\P X J so the average running time per 
node is as claimed. □ 



